Clay County
Viability of perturbative expansion for quantum field theories on neurons
Accelerated progress in machine learning (ML) over the past decade has had significant impact across many research domains, including physics, and has motivated substantial interdisciplinary work. At the intersection of physics and machine learning, two prominent practical questions have emerged: 1. Can techniques from statistical mechanics and the path integral formulation of quantum field theory (QFT) help us build a theoretical understanding of how neural networks learn? 2. Can neural networks be used to facilitate computations in quantum field theory? These two questions are deeply interrelated, and will motivate the questions we explore in this work. The second question itself splits naturally into two subcategories: (a) applied machine learning for physics problems, and (b) the theoretical interplay between machine learning and QFT techniques. The area of applied ML to physics has already seen considerable progress.
- North America > United States > South Dakota > Clay County > Vermillion (0.14)
- North America > United States > Iowa > Story County > Ames (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
LakotaBERT: A Transformer-based Model for Low Resource Lakota Language
Parankusham, Kanishka, Rizk, Rodrigue, Santosh, KC
Lakota, a critically endangered language of the Sioux people in North America, faces significant challenges due to declining fluency among younger generations. This paper introduces LakotaBERT, the first large language model (LLM) tailored for Lakota, aiming to support language revitalization efforts. Our research has two primary objectives: (1) to create a comprehensive Lakota language corpus and (2) to develop a customized LLM for Lakota. We compiled a diverse corpus of 105K sentences in Lakota, English, and parallel texts from various sources, such as books and websites, emphasizing the cultural significance and historical context of the Lakota language. Utilizing the RoBERTa architecture, we pre-trained our model and conducted comparative evaluations against established models such as RoBERTa, BERT, and multilingual BERT. Initial results demonstrate a masked language modeling accuracy of 51% with a single ground truth assumption, showcasing performance comparable to that of English-based models. We also evaluated the model using additional metrics, such as precision and F1 score, to provide a comprehensive assessment of its capabilities. By integrating AI and linguistic methodologies, we aspire to enhance linguistic diversity and cultural resilience, setting a valuable precedent for leveraging technology in the revitalization of other endangered indigenous languages.
- North America > United States > Texas > Travis County > Austin (0.04)
- North America > United States > South Dakota > Clay County > Vermillion (0.04)
- North America > United States > North Dakota (0.04)
- (2 more...)
A Reverse Mamba Attention Network for Pathological Liver Segmentation
Zeng, Jun, Jha, Debesh, Aktas, Ertugrul, Keles, Elif, Medetalibeyoglu, Alpay, Antalek, Matthew, Lewandowski, Robert, Ladner, Daniela, Borhani, Amir A., Durak, Gorkem, Bagci, Ulas
We present RMA-Mamba, a novel architecture that advances the capabilities of vision state space models through a specialized reverse mamba attention module (RMA). The key innovation lies in RMA-Mamba's ability to capture long-range dependencies while maintaining precise local feature representation through its hierarchical processing pipeline. By integrating Vision Mamba (VMamba)'s efficient sequence modeling with RMA's targeted feature refinement, our architecture achieves superior feature learning across multiple scales. This dual-mechanism approach enables robust handling of complex morphological patterns while maintaining computational efficiency. We demonstrate RMA-Mamba's effectiveness in the challenging domain of pathological liver segmentation (from both CT and MRI), where traditional segmentation approaches often fail due to tissue variations. When evaluated on a newly introduced cirrhotic liver dataset (CirrMRI600+) of T2-weighted MRI scans, RMA-Mamba achieves the state-of-the-art performance with a Dice coefficient of 92.08%, mean IoU of 87.36%, and recall of 92.96%. The architecture's generalizability is further validated on the cancerous liver segmentation from CT scans (LiTS: Liver Tumor Segmentation dataset), yielding a Dice score of 92.9% and mIoU of 88.99%. Our code is available for public: https://github.com/JunZengz/RMAMamba.
- North America > United States > Illinois > Cook County > Chicago (0.06)
- Asia > China > Chongqing Province > Chongqing (0.04)
- North America > United States > South Dakota > Clay County > Vermillion (0.04)
- Health & Medicine > Therapeutic Area > Oncology (1.00)
- Health & Medicine > Diagnostic Medicine > Imaging (1.00)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Sensing and Signal Processing > Image Processing (0.97)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
- Information Technology > Artificial Intelligence > Representation & Reasoning (0.90)
Leveraging Multi-AI Agents for Cross-Domain Knowledge Discovery
Aryal, Shiva, Do, Tuyen, Heyojoo, Bisesh, Chataut, Sandeep, Gurung, Bichar Dip Shrestha, Gadhamshetty, Venkataramana, Gnimpieba, Etienne
In the rapidly evolving field of artificial intelligence, the ability to harness and integrate knowledge across various domains stands as a paramount challenge and opportunity. This study introduces a novel approach to cross-domain knowledge discovery through the deployment of multi-AI agents, each specialized in distinct knowledge domains. These AI agents, designed to function as domain-specific experts, collaborate in a unified framework to synthesize and provide comprehensive insights that transcend the limitations of single-domain expertise. By facilitating seamless interaction among these agents, our platform aims to leverage the unique strengths and perspectives of each, thereby enhancing the process of knowledge discovery and decision-making. We present a comparative analysis of the different multi-agent workflow scenarios evaluating their performance in terms of efficiency, accuracy, and the breadth of knowledge integration. Through a series of experiments involving complex, interdisciplinary queries, our findings demonstrate the superior capability of domain specific multi-AI agent system in identifying and bridging knowledge gaps. This research not only underscores the significance of collaborative AI in driving innovation but also sets the stage for future advancements in AI-driven, cross-disciplinary research and application. Our methods were evaluated on a small pilot data and it showed a trend we expected, if we increase the amount of data we custom train the agents, the trend is expected to be more smooth.
- North America > United States > South Dakota > Clay County > Vermillion (0.16)
- North America > United States > South Dakota > Pennington County > Rapid City (0.04)
Enhancing Bangla Fake News Detection Using Bidirectional Gated Recurrent Units and Deep Learning Techniques
Roy, Utsha, Tahosin, Mst. Sazia, Hassan, Md. Mahedi, Islam, Taminul, Imtiaz, Fahim, Sadik, Md Rezwane, Maleh, Yassine, Sulaiman, Rejwan Bin, Talukder, Md. Simul Hasan
The rise of fake news has made the need for effective detection methods, including in languages other than English, increasingly important. The study aims to address the challenges of Bangla which is considered a less important language. To this end, a complete dataset containing about 50,000 news items is proposed. Several deep learning models have been tested on this dataset, including the bidirectional gated recurrent unit (GRU), the long short-term memory (LSTM), the 1D convolutional neural network (CNN), and hybrid architectures. For this research, we assessed the efficacy of the model utilizing a range of useful measures, including recall, precision, F1 score, and accuracy. This was done by employing a big application. We carry out comprehensive trials to show the effectiveness of these models in identifying bogus news in Bangla, with the Bidirectional GRU model having a stunning accuracy of 99.16%. Our analysis highlights the importance of dataset balance and the need for continual improvement efforts to a substantial degree. This study makes a major contribution to the creation of Bangla fake news detecting systems with limited resources, thereby setting the stage for future improvements in the detection process.
- Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.05)
- North America > United States > South Dakota > Clay County > Vermillion (0.04)
- North America > United States > Illinois (0.04)
- (4 more...)
Enabling clustering algorithms to detect clusters of varying densities through scale-invariant data preprocessing
Aryal, Sunil, Wells, Jonathan R., Baniya, Arbind Agrahari, Santosh, KC
In this paper, we show that preprocessing data using a variant of rank transformation called 'Average Rank over an Ensemble of Sub-samples (ARES)' makes clustering algorithms robust to data representation and enable them to detect varying density clusters. Our empirical results, obtained using three most widely used clustering algorithms-namely KMeans, DBSCAN, and DP (Density Peak)-across a wide range of real-world datasets, show that clustering after ARES transformation produces better and more consistent results.
- Oceania > Australia (0.05)
- North America > United States > South Dakota > Clay County > Vermillion (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- (2 more...)
Parkinson's Disease Detection through Vocal Biomarkers and Advanced Machine Learning Algorithms
Sayed, Md Abu, Tayaba, Maliha, Islam, MD Tanvir, Pavel, Md Eyasin Ul Islam, Mia, Md Tuhin, Ayon, Eftekhar Hossain, Nob, Nur, Ghosh, Bishnu Padh
Parkinson's disease (PD) is a prevalent neurodegenerative disorder known for its impact on motor neurons, causing symptoms like tremors, stiffness, and gait difficulties. This study explores the potential of vocal feature alterations in PD patients as a means of early disease prediction. This research aims to predict the onset of Parkinson's disease. Utilizing a variety of advanced machine-learning algorithms, including XGBoost, LightGBM, Bagging, AdaBoost, and Support Vector Machine, among others, the study evaluates the predictive performance of these models using metrics such as accuracy, area under the curve (AUC), sensitivity, and specificity. The findings of this comprehensive analysis highlight LightGBM as the most effective model, achieving an impressive accuracy rate of 96% alongside a matching AUC of 96%. LightGBM exhibited a remarkable sensitivity of 100% and specificity of 94.43%, surpassing other machine learning algorithms in accuracy and AUC scores. Given the complexities of Parkinson's disease and its challenges in early diagnosis, this study underscores the significance of leveraging vocal biomarkers coupled with advanced machine-learning techniques for precise and timely PD detection.
- North America > United States > South Dakota > Clay County > Vermillion (0.14)
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- North America > United States > Nevada > Clark County > Las Vegas (0.04)
- (12 more...)
- Health & Medicine > Therapeutic Area > Neurology > Parkinson's Disease (1.00)
- Health & Medicine > Therapeutic Area > Musculoskeletal (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Majorana Demonstrator Data Release for AI/ML Applications
Arnquist, I. J., Avignone, F. T. III, Barabash, A. S., Barton, C. J., Bhimani, K. H., Blalock, E., Bos, B., Busch, M., Buuck, M., Caldwell, T. S., Chan, Y. -D., Christofferson, C. D., Chu, P. -H., Clark, M. L., Cuesta, C., Detwiler, J. A., Efremenko, Yu., Ejiri, H., Elliott, S. R., Fuad, N., Giovanetti, G. K., Green, M. P., Gruszko, J., Guinn, I. S., Guiseppe, V. E., Haufe, C. R., Henning, R., Aguilar, D. Hervas, Hoppe, E. W., Hostiuc, A., Kidd, M. F., Kim, I., Kouzes, R. T., Lannen, T. E. V, Li, A., Lopez-Castano, J. M., Martin, R. D., Massarczyk, R., Meijer, S. J., Mertens, S., Oli, T. K., Paudel, L. S., Pettus, W., Poon, A. W. P., Quenallata, B., Radford, D. C., Reine, A. L., Rielage, K., Ruof, N. W., Schaper, D. C., Schleich, S. J., Tedeschi, D., Varner, R. L., Vasilyev, S., Watkins, S. L., Wilkerson, J. F., Wiseman, C., Xu, W., Yu, C. -H., Zhu, B. X.
The enclosed data release consists of a subset of the calibration data from the Majorana Demonstrator experiment. Each Majorana event is accompanied by raw Germanium detector waveforms, pulse shape discrimination cuts, and calibrated final energies, all shared in an HDF5 file format along with relevant metadata. This release is specifically designed to support the training and testing of Artificial Intelligence (AI) and Machine Learning (ML) algorithms upon our data. This document is structured as follows. Section I provides an overview of the dataset's content and format; Section II outlines the location of this dataset and the method for accessing it; Section III presents the NPML Machine Learning Challenge associated with this dataset; Section IV contains a disclaimer from the Majorana collaboration regarding the use of this dataset; Appendix A contains technical details of this data release. Please direct questions about the material provided within this release to liaobo77@ucsd.edu (A. Li).
- North America > United States > Washington > King County > Seattle (0.14)
- North America > United States > Tennessee > Knox County > Knoxville (0.14)
- North America > United States > South Dakota > Clay County > Vermillion (0.14)
- (21 more...)
- Research Report (0.64)
- Overview (0.54)
- Energy (1.00)
- Government > Regional Government > North America Government > United States Government (0.70)
Machine Learning Approach on Multiclass Classification of Internet Firewall Log Files
Rahman, Md Habibur, Islam, Taminul, Rana, Md Masum, Tasnim, Rehnuma, Mona, Tanzina Rahman, Sakib, Md. Mamun
Firewalls are critical components in securing communication networks by screening all incoming (and occasionally exiting) data packets. Filtering is carried out by comparing incoming data packets to a set of rules designed to prevent malicious code from entering the network. To regulate the flow of data packets entering and leaving a network, an Internet firewall keeps a track of all activity. While the primary function of log files is to aid in troubleshooting and diagnostics, the information they contain is also very relevant to system audits and forensics. Firewalls primary function is to prevent malicious data packets from being sent. In order to better defend against cyberattacks and understand when and how malicious actions are influencing the internet, it is necessary to examine log files. As a result, the firewall decides whether to 'allow,' 'deny,' 'drop,' or 'reset-both' the incoming and outgoing packets. In this research, we apply various categorization algorithms to make sense of data logged by a firewall device. Harmonic mean F1 score, recall, and sensitivity measurement data with a 99% accuracy score in the random forest technique are used to compare the classifier's performance. To be sure, the proposed characteristics did significantly contribute to enhancing the firewall classification rate, as seen by the high accuracy rates generated by the other methods.
- Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.05)
- North America > United States > South Dakota > Clay County > Vermillion (0.04)
- Asia > Indonesia (0.04)
- Information Technology > Security & Privacy (1.00)
- Government > Military > Cyberwarfare (0.48)
- Information Technology > Communications > Networks (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Generative Adversarial Networks for Scintillation Signal Simulation in EXO-200
Li, S., Ostrovskiy, I., Li, Z., Yang, L., Kharusi, S. Al, Anton, G., Badhrees, I., Barbeau, P. S., Beck, D., Belov, V., Bhatta, T., Breidenbach, M., Brunner, T., Cao, G. F., Cen, W. R., Chambers, C., Cleveland, B., Coon, M., Craycraft, A., Daniels, T., Darroch, L., Daugherty, S. J., Davis, J., Delaquis, S., Der Mesrobian-Kabakian, A., DeVoe, R., Dilling, J., Dolgolenko, A., Dolinski, M. J., Echevers, J., Fairbank, W. Jr., Fairbank, D., Farine, J., Feyzbakhsh, S., Fierlinger, P., Fu, Y. S., Fudenberg, D., Gautam, P., Gornea, R., Gratta, G., Hall, C., Hansen, E. V., Hoessl, J., Hufschmidt, P., Hughes, M., Iverson, A., Jamil, A., Jessiman, C., Jewell, M. J., Johnson, A., Karelin, A., Kaufman, L. J., Koffas, T., Krücken, R., Kuchenkov, A., Kumar, K. S., Lan, Y., Larson, A., Lenardo, B. G., Leonard, D. S., Li, G. S., Licciardi, C., Lin, Y. H., MacLellan, R., McElroy, T., Michel, T., Mong, B., Moore, D. C., Murray, K., Njoya, O., Nusair, O., Odian, A., Perna, A., Piepke, A., Pocar, A., Retière, F., Robinson, A. L., Rowson, P. C., Runge, J., Schmidt, S., Sinclair, D., Skarpaas, K., Soma, A. K., Stekhanov, V., Tarka, M., Thibado, S., Todd, J., Tolba, T., Totev, T. I., Tsang, R.
Generative Adversarial Networks trained on samples of simulated or actual events have been proposed as a way of generating large simulated datasets at a reduced computational cost. In this work, a novel approach to perform the simulation of photodetector signals from the time projection chamber of the EXO-200 experiment is demonstrated. The method is based on a Wasserstein Generative Adversarial Network - a deep learning technique allowing for implicit non-parametric estimation of the population distribution for a given set of objects. Our network is trained on real calibration data using raw scintillation waveforms as input. We find that it is able to produce high-quality simulated waveforms an order of magnitude faster than the traditional simulation approach and, importantly, generalize from the training sample and discern salient high-level features of the data. In particular, the network correctly deduces position dependency of scintillation light response in the detector and correctly recognizes dead photodetector channels. The network output is then integrated into the EXO-200 analysis framework to show that the standard EXO-200 reconstruction routine processes the simulated waveforms to produce energy distributions comparable to that of real waveforms. Finally, the remaining discrepancies and potential ways to improve the approach further are highlighted.
- North America > United States > California > Alameda County > Berkeley (0.28)
- North America > United States > Kentucky > Fayette County > Lexington (0.14)
- North America > Canada > Ontario > National Capital Region > Ottawa (0.14)
- (40 more...)